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ticle physics. Nevertheless, each may be violated by simple modifications to the dynamics 
while apparently preserving the essential fundamental structure of quantum field theory 
itself. In this paper, we analyse the construction of strong equivalence, Lorentz and CPT 
violating Lagrangians for QED and review and propose some experimental tests in the 
fields of astrophysical polarimetry and precision atomic spectroscopy. In particular, mod- 
ifications of the Maxwell action predict a birefringent rotation of the direction of linearly 
polarised radiation from synchrotron emission which may be studied using radio galaxies 
or, potentially, gamma-ray bursts. In the Dirac sector, changes in atomic energy levels are 
predicted which may be probed in precision spectroscopy of hydrogen and anti-hydrogen 
atoms, notably in the Doppler-free, two-photon Is — 2s and 2s — nd (n ~ 10) transitions. 
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1. Introduction 

The quantum field theories used to describe elementary particle physics incorporate a 
number of basic principles including Lorentz invariance, unitarity, causality, locality and 
CPT symmetry and, in the presence of gravity, the weak and strong equivalence principles. 
Although there are no compelling theoretical or experimental reasons to question these 
principles, it is nevertheless interesting to speculate on the experimental consequences 
should any of them turn out to be violated in nature. 

In this paper, we consider modifications of the dynamics of QED exhibiting either 
violations of the strong equivalence principle or of Lorentz and CPT invariance and dis- 
cuss certain experimental signatures, notably in the surprisingly related fields of precision 
hydrogen and anti-hydrogen spectroscopy and polarimetry of astrophysical sources such as 
radio galaxies or gamma-ray bursts. 

The weak equivalence principle requires the existence at each spacetime point of a 
local incrtial frame of reference. This is fundamental to the structure of general relativity 
and is realised by formulating spacetime as a (pseudo-)Riemannian manifold. We will keep 
this as the basis for quantum field theory in curved spacetime. The strong equivalence 
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principle, however, apparently has a rather different status, being simply a restriction on 
the dynamics of the theory. It requires that the laws of physics (i.e. dynamics) are iden- 
tical in each of these local inertial frames, where they reduce to their special relativistic 
form. Precisely, this requires the matter fields to couple to gravity through the connec- 
tions only (minimal substitution), not through direct couplings to the curvature tensors. 
Strong equivalence violation can therefore be realised by including in the QED Lagrangian 
interaction terms coupling the electromagnetic or Dirac fields directly to the curvature, 
e.g. l/M'^Rni.xpFf^^F^f^ or l/]VpRniyipj^^D'^i/j. M is a characteristic mass scale for strong 
equivalence violation and is to be determined, or bounded, from experiment. 

In the electromagnetic sector, it is known from the original work of Drummond and 
Hathrell[l] that terms of this type arise in the effective action for QED in curved spacctime 
as a result of integrating out electron loops. In this case, the mass scale M is simply the 
electron mass m and the coefficients are of order a, the fine structure constant. In previous 
work[l, 2, 3, 4, 5, 6], we have investigated at length the implications of this effective 
quantum-induced violation of the strong equivalence principle for photon propagation. By 
far the most interesting result is the apparent prediction of superluminal propagation of 
light. A careful review of this effect and its implications for causality has been given 
in refs.[6, 7], where it is shown that in general relativity, superluminal propagation may 
in fact be compatible with stable causality. The situation is complicated by the fact 
that the full effective action for QED in curved spacetime[6, 8] involves interactions and 
form factors depending on spacetime covariant derivatives in addition to the terms in 
the original Drummond- Hathrell action, e.g. l/M^Rf^^xpDaF'^^D^F^^'^ . This implies that 
light propagation is dispersive and the analysis of causality requires the identification of 
the relevant 'speed of light', which turns out to be identifiable as the high-frequency limit 
of the phase velocity [9]. 

In section 2, we construct a general strong equivalence violating QED Lagrangian in 
both the photon and electron sectors, keeping interactions of all orders in derivatives but, as 
above, only of first order in the gravitational curvature. The photon part of the Lagrangian 
is essentially the same as the effective action derived in ref.[8]. The electron Lagrangian, 
however, is quite new, although related to previous work on the propagation of massless 
neutrinos in a background gravitational field [10]. 

It is important to realise that, unlike the photon case, this electron Lagrangian does not 
have an interpretation as a low-energy effective action for QED - there is no way of 'integrat- 
ing out the photon field' to leave a local, infra-red finite effective action describing electron 
propagation in curved spacetime. Nevertheless, we can relate our strong equivalence violat- 
ing generalisation of the Dirac Lagrangian to previous analyses of the electron propagator 
in curved spacetime (more precisely, the evaluation of the electron matrix elements of the 
energy-momentum tensor) incorporating one-loop self-energy contributions [11, 12, 13] 

Lorentz and CPT violating dynamics is introduced in a similar way. We begin as 
usual with the standard formulation of QED in Minkowski spacetime, but include tensor 
operators in the action multiplied by multi-index coupling constants, e.g. K^^Xi^F^^^ F^P or 
c^yTp^^D^'ip. Since these couplings are simply collections of constants, not tensor fields in 
their own right, Lorentz invariance is explicitly violated in the dynamics of this modified 
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QED. Although it is not necessary for this phenomenological approach, we can imagine 
these couphngs to be Lorentz-violating VEVs of tensor fields in some more fundamental 
theory. This approach to Lorentz and CPT violation has been pioneered by Kostelecky[14], 
who has constructed the most general Lorentz-violating generalisation of the standard 
model incorporating renormalisable (dimension < 4) operators. In an extended series of 
papers, Kostelecky and collaborators have examined in detail a great variety of potential 
experimental signatures of these new interactions. For a review, see, e.g. ref.[15]. 

The analogy between these Lorentz and CPT violating Lagrangians and the strong 
equivalence violating QED Lagrangian is clear, with, for example, the coupling constant 
Kfj,i/Xp playing the role of the Riemann tensor RfiuXp- It is therefore easy to translate aspects 
of the phenomenology of one theory to the other. As we shall see in section 4, this leads 
to some new insights and economy of analysis. 

A novel feature of the Kostelecky Lagrangian is its inclusion, as a subset of the Lorentz- 
violating terms, of interactions that are CPT odd. We recall that CPT invariance is a 
general theorem of quantum field theory assuming the general principles of Lorentz in- 
variance, locality and microcausality are respected (see, e.g. ref.[16]. The Kostelecky La- 
grangian evades the CPT theorem by explicitly breaking Lorentz invariance. It therefore 
provides an interesting arena in which to study the possible experimental signatures of 
CPT violation. This is particularly important at present in view of the current experi- 
ments on anti-hydrogen spectroscopy which are designed to test the limits of CPT with 
unprecedented accuracy. 

In the second part of this paper, we turn to some possible experimental consequences 
of these novel phenomenological Lagrangians. As wc have already extensively investigated, 
the strong equivalence violating modifications to the QED Lagrangian have important 
consequences for photon propagation. The light cone is modified and in many cases super- 
luminal propagation becomes possible. Moreover, if the Weyl tensor CpuXp (the trace-free 
part of the Riemann tensor) is non-vanishing, the effective light cone acquires a depen- 
dence on the polarisation, so that the speed of light becomes polarisation dependent. This 
is the phenomenon of gravitational birefringence. Wc immediately realise that the same 
birefringent effect will also arise with the Lorentz-violating Lagrangian [14], provided the 
coupling K^i^xp has a trace-free component analogous to Cp^^xp- The Newman-Pcnrose for- 
malism, which was found to be an elegant way of analysing results in the strong equivalence 
violating case, may similarly be usefully adapted to the Lorentz-violating Lagrangians. 

One of the most important physical manifestations of birefringence is the rotation of 
the plane of linearly polarised light as it passes through the birefringent medium, in our 
case either the background gravitational field or, in the Lorentz-violating scenario, simply 
the vacuum. Since we expect any strong equivalence, Lorentz or CPT violations to be tiny, 
detection of this birefringent rotation requires experiments with the longest possible base- 
line, since the rotation angle will be proportional to the distance the light has propagated 
through the birefringent medium. This leads us to focus on astrophysical sources which 
emit radiation with a well-defined orientation of linear polarisation. A thorough analysis 
of this effect for the case of the CPT-violating Chern-Simons interaction L^^€py\pA^ F^'' in 
the Kostelecky Lagrangian was performed some time ago by Carroll, Field and Jackiw[17], 
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who were able to bound the couphng by placing limits on such a birefringent rotation 
from observations of synchrotron emission from radio galaxies. In section 4, we briefly 
review this and more recent work by Kostelecky and Mewes[18], then go on to discuss the 
possibility of improving these bounds by considering higher redshift sources, in particular 
by exploiting the linear polarisation observed in the afterglow of gamma-ray bursts to look 
for a Lorentz-violating birefringent rotation. 

Curiously, while looking for strong equivalence, Lorentz and CPT violation in the 
photon sector requires experiments on astronomical scales, testing the electron sector is the 
domain of precision atomic spectroscopy. Even more intriguingly, since symmetry violation 
in the photon sector implies, through radiative corrections, a corresponding violation in the 
electron sector (and vice- versa), experimental bounds on the Kostelecky photon couplings 
KfjiuXp and L'^ and the electron couplings afj_,bu, . . . , h^^ are in principle correlated. That 
is, bounds on Lorentz and CPT violation found from astrophysics may also constrain the 
violation of these symmetries in the experiments on anti-hydrogen spectroscopy. However, 
there are some caveats here related to subtle anomaly-related issues in QED[19], so the 
situation is actually not quite so clear-cut (see section 3). 

In section 5, therefore, we address the question of strong equivalence, Lorentz and 
CPT violation in the electron sector of QED, using the appropriately modified Dirac equa- 
tion to recalculate atomic energy levels and their implications for precision hydrogen and 
anti-hydrogen spectroscopy. Following the production of significant numbers of cold anti- 
hydrogen atoms by the ATHENA[20, 21] and ATRAP[22, 23] experiments at CERN, the 
potential of performing precision measurements of the anti-hydrogen as well as hydrogen 
spectrum may be realised, and it is hoped that experiments will soon be underway to test 
CPT to high accuracy by comparing the frequencies of the Is — 2s transition in hydro- 
gen and anti-hydrogen. This transition is favoured because of the extremely narrow line 
width of the 2s state, since the only available decay is the Doppler-free, two-photon Is — 2s 
transition. However, within the framework of the Kostelecky Lagrangian, CPT violation 
would only be observable in the hyperfine Zeeman splittings of this transition for trapped 
anti-hydrogen [24], where the line widths would be subject to Zeeman broadening. Here, 
we present a slightly extended analysis of the energy level shifts predicted by the modified 
Dirac equation and propose an alternative measurement which in practice may be of com- 
parable accuracy, namely the Doppler-free, two-photon 2s — nd (n ~ 10) transition for 
hydrogen and anti-hydrogen. 

2. Strong Equivalence Violating QED 

The weak equivalence principle requires the existence of a local inertial frame at each 
spacetime point. This is realised by making spacetime a (pseudo-)Riemannian manifold, 
which admits a fiat tangent space at each point, related to spacetime via the vierbein e^a 
(where a is the tangent space index). In quantum field theory in curved spacetime, the 
fundamental fields are defined on this tangent space. 

The strong equivalence principle is invoked to constrain the dynamics. It requires 
that the laws of physics are the same at the origin of the local inertial frame at each 
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spacetime point, where they reduce to their usual special relativistic form. This is realised 
by requiring the coupling of the fields to gravity to involve only the spacetime connection, 
not directly the curvature tensors. It is equivalent to minimal substitution, where the 
general relativistic equations of motion are found from the special relativistic ones by 
substituting the covariant derivatives for ordinary derivatives. The same principle is of 
course already used to construct the usual QED Lagrangian itself - the electron field tp 
is coupled to elcctromagnetism via the gauge covariant derivative only, with the (non- 
renormalisable) operators coupling 't/^ip bilinears directly to the field strength F^i, being 
omitted. The conventional QED action is therefore simply 



(2.1) 



where D^tp = {d^ — ^ujfiab^^ + i^^^)^ is the covariant derivative, involving the spin 



connection ui^ab = &\a{d^ie^b + ^^v^'^b)- Here, a"^ = |[7",7^], and in what follows we shall 



usually abbreviate 7'^ = e^al""- 
2.1 Extended Maxwell action 

The strong equivalence principle is violated by the introduction of direct coTiplings to 
the curvature tensors Rfj^^xp, Ri_ii, and R, e.g. l/M'^R^^XpF^^ F'^'^ . This implies that the 
local dynamics now distinguishes between spacetime points, since it obviously depends 
on the curvature at each point. Since these new interactions necessarily involve non- 
renormalisable (dim > 4) operators, their inclusion introduces a new scale M, which in 
our phenomenological approach characterises the scale of strong equivalence violation. Its 
value is to be determined, or bounded, by experiment. 

We will only consider operators which are linear in the curvature. This is most natu- 
rally interpreted as keeping only the lowest-order terms in an expansion of a more general 
Lagrangian in 0(i?/M^), so we regard the resulting theory as a valid approximation for 
gravitational fields which are weak on the scale M. On the other hand, we will write 
down an action which involves all orders in covariant derivatives acting on the fields. In 
this case, keeping only terms of lowest order in 0(D^/M^) is a low-momentum (or low- 
frequency) approximation. This will nevertheless be a useful first step in understanding 
the phenomenology of strong equivalence violation. 

A systematic analysis of all possible 0{R/M'^) operators shows that the most general 
strong equivalence violating extension of the Maxwell sector of QED at this order has the 
action 



\F^,F^^- + -^(D^F^^>^toD,F 



V 

A 



+ RF^^FI""^ + R^uFt'^F^'x + R^.^xpFi''' F^P^ 
+ ■^{alRD^Ff'^D.F^ + Rf^^OxF^t'DpFP'' 
+ U R/^uD'^F^PD'^Fxp + Ri,,D''D>^FxpFP'' 
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+ ct R^.xpDaF'^PD^Fi'- ^ + -^bl R^^Di^D^F^'^D'^DpFP^ 
In this formula, the a^, 6„, are form factor functions of three operators, i.e. 



(2.2) 



r)2 r)2 r)2 

where the first entry {D'^i^) acts on the first following term (the curvature), etc. do is 
similarly defined as a single variable function. 

To establish eq.(2.2), extensive use has been made of the Bianchi identities for the 

electromagnetic field strength and the curvature tensors, viz. Dy^F^^^^ = 0, -D[o-R^i/]Ap = 
0, D^[R^u — ^Rg^y) = 0, the commutation relation [D^,Dy\ ~ 0{R) for covariant deriva- 
tives, and repeated use of integration by parts to show the equivalence in the action of 
different operators. As a simple illustration, notice for example that 

j d'^x R^^Di'D^'F^PFxp = - Jd^x Rf^^^D^'F^PD'^Fxp + \ j d^xD'^RF^PF^p (2.4) 

so the apparently independent operator on the l.h.s. is equivalent to a combination of the 
62 and 60 terms, taking account of the form factors.^ 

In applications of eq.(2.2), in particular to the analysis of birefringence in section 4, it 
is often convenient to introduce the Weyl tensor C^,/Ap- Indeed, it is only the Weyl tensor 
contribution which gives rise to birefringence. Explicitly, C^i^xp is the trace-free part of the 
Riemann tensor: 



Cp.i/Xp — RjxvXp 2^^t^^3yp Rp,p9uX RvX9np RypQ^iX) 

+ \R{g^lxgup - QfipQux) 



(2.5) 



In contrast to the Ricci tensor R^u, the Weyl tensor is not directly constrained by matter 
via the Einstein field equations. The two operators involving the Riemann tensor in eq.(2.2) 
can be re-expressed in terms of the Weyl tensor and combinations of the Ricci tensor and 
Ricci scalar terms as follows. (Note that these identities hold only under the integral and 
to 0{R).) 



j d'xV^ C^uXpF'^'^F^P = J d^x^g 



R/.^XpF^'^F^P 



2Rp,yF''^F''x + -RF'^'F, 



pv 

(2.6) 



and 



^Notice that in the effective action in ref.[6, 8], / d'^x Rf.^D'^ D" F^" Fxp and / d^x Ri.^D^' F^p D" Fxp 
were included as independent terms. The effective action quoted there can therefore be simplified. They do, 
however, arise from the quite different operators txR^^D^ D'^ PP and tTRi^i,D'^'R.'^P D'^'R,\p in the Barvinsky 
et al. action for general background fields [25, 26]. 
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~Rf,,DxF^''DpFP'' - R^^D^'D^FxpFP" 

(2.7) 



-^RDfj^F'^^D^F''^ - ^RD'^F^/'Fxp 



While in this paper we axe considering eq.(2.2) as a phenomenological Lagrangian ex- 
hibiting violation of the strong equivalence principle, it is important to realise that the 
same expression (with M interpreted as the electron mass) arises in conventional QED 
in curved spacetime as the effective action for the electromagnetic field when we include 
one-loop vacuum polarisation to integrate out the electron field. It is therefore the appro- 
priate action to use to study photon propagation in curved spacetime, taking into account 
quantum effects at 0{a). There is just one difference - at one-loop, the quantum effective 
action does not have the 0(1/M^) term that we have included above. Exact expressions 
for the form factors derived in QED at 0{a) are known and are given in ref.[8]. 

2.2 Extended Dirac action 

A similar analysis can be carried out to find the most general free Dirac action compris- 
ing strong equivalence breaking operators of first order in the curvature but all orders in 
derivatives. A systematic study of all the possibilities shows that the following is a complete 
basis: 



+^ ( iR^^Dt^^^.DD-^ + r5 iR^uDxiDi^ipl^D-^l;) ) 
+^ iRp,Di^Dxi^a>^PD-Dpi^ +h.c. 



(2. 



where as before the 7 ni ^ n are form factor functions of derivatives. 

A notable feature of eq.(2.8) is that there is no independent term involving the Riemann 

tensor. That is, there is no possible term in the action that can be built from the trace-free 
Weyl tensor. This follows relatively straightforwardly from the symmetries of R^vXp and 
the (Bianchi) identity DPRpi,xp = DxRvp — DpRi,x- The independence of this Dirac action 
on the Weyl tensor is in marked contrast to the Maxwell action. 

At first sight, it might be thought that just as the extended Maxwell action can be 
realised as the quantum effective action governing photon propagation in the presence of 
vacuum polarisation, this extended Dirac action could similarly be realised as an effective 
action for electrons incorporating one-loop self-energy corrections in the presence of gravity. 
In fact, there is no local effective action which accomplishes this[13]. The reason is that 
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whereas it makes sense to write a low-energy effective Lagrangian valid below the scale of 
the electron mass, it is not possible to do the same with the massless photon. If we try, we 
find that the the form factors fi,gi, hi in the Dirac action cannot be local, i.e. polynomial 
functions in D^. As we see below, their Fourier transforms include infra-red singular 
logarithms of momentum. 

On the other hand, a slight generalisation of eq.(2.8) to chiral fermions is a good effec- 
tive action for the propagation of neutrinos in curved spacetime. In this case, the relevant 
one-loop self-energy diagrams involve the W or Z boson propagators and the quantum 
corrections can be encoded at low energies by an effective action for the neutrinos alone, 
with the strong equivalence breaking scale M being identified with the weak scale mw- 
An analysis of neutrino propagation in curved spacetime has been carried out some time 
ago by Ohkuwa[10], generalising the results of Drummond and Hathrell[l] for photons. To 
lowest order in curvature and for low-momentum neutrinos, he finds that the h i, "^2; 7^3 
and ~g^i operators, modified to include left-handed chiral propagators and with constant 
coefficients of 0{a/mw), are sufficient to encode the one-loop self-energy corrections, and 
computes the 51(0, 0,0) coefficient (the only one which affects the neutrino velocity). Just 
as in the photon case, this allows the possibility of superluminal propagation of (massless) 
neutrinos in certain spacetimes, notably the FRW universe. However, whereas there is 
a bircfringcnt shift in the photon velocity (with one polarisation being superluminal) for 
Ricci-flat spacetimes such as the Schwarzschild black hole, the neutrino velocity remains 
equal to the speed of light in this case. Our construction of the general Dirac action (2.8) 
shows that in fact this is a general result, valid beyond the low-momentum approximation. 

Eq.(2.8) can easily be extended to include electron-photon interactions by promoting 
the spacetime covariant derivatives to be gauge covariant as well. As explained above, this 
is the conventional prescription of minimal substitution which, in ordinary QED, ensures 
a renormalisable Lagrangian. It could reasonably be argued, however, that in the spirit 
of constructing (non-renormalisable) strong equivalence violating phenomenological exten- 
sions of QED, we should also allow violation of the gauge minimal substitution principle as 
well and include operators coupling the electron bilinears ipip directly to the electromag- 
netic field strength F^i, as well as the spacetime curvatures. Examples of such interactions 
would be Ffj^iiitp^i^D'^^ or F^i,^a'^'^ip. Since we will not make use of such interactions in 
this paper, however, we will not present a classification of all such possibilities here. 

2.3 Energy-momentum tensor 

After reviewing the analogous formalism for broken Lorentz and CPT violation in the next 
section, we shall discuss the implications of these extended Maxwell and Dirac actions 
for the propagation of polarised light and for atomic spectroscopy in sections 4 and 5 
respectively. 

First, though, we shall evaluate the electron matrix element of the energy- momentum 
tensor using the Dirac action (2.8). This is an important object in several contexts, ranging 
from the study of general relativity as a low-energy effective theory of quantum gravity[13] 
to deep inelastic scattering, where the energy-momentum tensor arises in the operator prod- 
uct expansion of electromagnetic currents and the form factors are identified as structure 
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functions (the matrix elements being taken between proton rather than electron states). 
It is also important in determining the explicit expressions for the form factors in any 
application where the extended Dirac action can be viewed as a conventional low-energy 
effective action, e.g. in the generalisation to neutrino propagation. So while the discussion 
that follows is rather formal, the results may have applications in a number of interesting 
contexts. 

The energy-momentum tensor matrix element in flat spacetime is defined as 



'-9 Si/jSgauStp 



u{p) (2.9) 

F.T., 9^^=111X1, 



which follows immediately from the definition of T^^'^ as the functional derivative of the 
action with respect to the metric. Interpreted in perturbative quantum gravity, this is the 
electron-graviton vertex. 

Its evaluation therefore amounts to taking the metric variation of the various terms in 
the extended Dirac action. We therefore collect here some useful formulae for functional 
derivatives: 



+ y^^g^%g''KDr6ix,y) 



R{y) = - (^Ri^'^ + ^M'^zps _ Dl^^v^^ y) 



(2.10) 
(2.11) 



'JffMf (^) 



Rxp{y) = - 5 {g'^xg^pD^ - g^DxD'' - g^xD^D'^ + g'^'^DpDx) d{x, y) (2.12) 



where, importantly, the derivatives are w.r.t. x. 

Acting on spinor quantities, the metric derivative is to be re-interpreted in terms of 
the vierbein as 

5 1 



(2.13) 



Recalling that the vanishing of the covariant derivative of the vierbein 

Dxe% = dxe^ + TPxae^-iVxbce'"' = (2.14) 
defines the spin connection as 

io^.ab = exaid^e^b + ^^^ue^b) = exaDt^e^b (2-15) 
where D is covariant only w.r.t. the curved spacetime index, we can also show that 



where, again, the derivative is w.r.t. x. Note that in this formalism, the connection T"^ 



is considered as independent of the vierbein e^^a- With this interpretation, we can then 
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readily check that the energy-momentum tensor for the conventional Dirac action is simply 



rptJ,!/ 



-g ^9fiv 



/ d^x^/^ i/j{ij.D — m)ijj + h.c. 
m)tp 



(2.17) 



The matrix elements of each of the terms in eq.(2.8) can now be taken in turn. For those 
already involving a curvature tensor, the only contributions to the fiat spacetime matrix 
elements are clearly those arising from the variations of the curvature tensors themselves 
using eqs.(2.11),(2.12). For these terms, therefore, we do not need to take the metric 
variations of the form factors themselves. We do, however, need to take into account the 
variations of the h i form factors. 

The matrix element {p'\T^^{q)\p), or electron-graviton vertex, can be easily shown 
to have only three possible independent Lorentz structures in momentum space. Each 
of these individually satisfies the conservation constraint q^T^^ = 0, which follows from 
diffeomorphism invariance of the action, using the equations of motion for the on-shell wave 
functions u{p), u{p'). (The kinematical variables used here are (p'^) for the initial (final) 
electron momenta and q^ = p'^—p^ for the momentum transfer (graviton momentum)). We 
can choose them to be (7^P^ + 7^P^), Pf'P" and {q'^q'' - q'^g'^"), where P^ = \{p'^ +p^). 
The form factors "/^j, 'g'j^ reduce on-shell to single functions of the momentum transfer 
squared, i.e. 



f I 2 l2 2\ 



on— shell 



(2.18) 



To simplify notation, we also let hi{—q ) — q h[{—q ) = hi{q ). 

Notice that the set of Lorentz structures chosen here is not unique. Another frequently 
used basis replaces the term {-f^^P" + j'^P'^) by j^^{cr^^qxP'^ + a^^qxP^). The translation 
between the two can be made using the Gordon identity 



—uip') af'^qx uip) = uip') i-f'' - — P'^) uip) 
m \ m J 



(2.19) 



In order to simplify the analysis, wc now choose to identify the strong equivalence 
breaking scale M with the electron mass ra so that there is only one mass scale in the 
problem. This could easily be relaxed of course for a specific application. For example, for 
neutrino propagation we would take M = mw and neglect the neutrino mass compared to 
the weak scale. Putting all this together, we then find after an extensive calculation: 



(p'ir'^^(g)lp) = u{p') \U-i^'P'' + YP^) 



_)_ _L pM 



-2{hi + h2) 



I -ho 



2^ 



I 1 Q ~ 
91 + 4^53 



+h{q^Q'-q^9n 



{h, + h2) + 4(/i + /2-i^/4) 
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5^ (52+54 + 553) + 56 



u{p) 

(2.20) 

This expression shows clearly how the various possible terms in the general extended Dirac 
action contribute to the three independent form factors in the energy-momentum tensor 
matrix element. 

Expressions of this type have appeared several times in the literature as the 0(h) 
contributions of one-loop electron self-energy diagrams in QED in the presence of a weak 
perturbation g^i^ = rj^i, + h^y of the Minkowski spacetime metric, i.e. as the QED correc- 
tions to the electron-graviton vertex in perturbative quantum gravity. Explicit expressions 
for the functions Fi{q'^), F2{q^) and Fs{q^) defined as the functions in square brackets in 
eq.(2.20) have been given for QED by several authors[ll, 12, 13]. For small momentum 
transfer, the results are 

= 1- "4^-47 + 7^^^ + 2 ^ /2.21) 

47r \ 18 2 , /_„2 3 / ^ ■' 



4i^\ , (2 22) 



^3 = t(^-^^-lln^) + ... (2.23) 



The 0(1) term in Fi, which follows directly from the free Dirac action, can readily be seen 
to be a consequence of momentum conservation. 

These expressions illustrate the essential problem in regarding the extended Dirac 
action too literally as a low-energy effective action in the usual sense. The non-analytic 
terms \/ —q^ and ln(— g^) are the signature of the long-range interactions in QED mediated 
by the massless photon. These interactions clearly cannot be 'integrated out'. However, 
they do carry important physical information and, as shown recently in rcf.[13], can be used 
to reconstruct both classical and quantum corrections to the Kerr-Newman and Reissner- 
Nordstrom metrics. 

The presence of non-analytic terms in the form factors for the energy-momentum 
tensor matrix elements prevents us from using eqs.(2.21),(2.22) and (2.23) to reconstruct 

polynomial form factors ~§ ^ and h i in the extended Dirac action and view it as a true 
local effective action for QED in the same way as we have done for the extended Maxwell 
action. This programme could be carried out, however, with the chiral extension of the 
action where the scale M is taken as the vector boson mass m-w and the result interpreted 
as the effective action for neutrino propagation. In this case, the self-energy quantum 
corrections can indeed be encoded in an effective action of this type and a generalisation 
of eq.(2.20) could be used, as in the work of Ohkuwa[10], to constrain the ~§ ^ and hi 
form factors by comparing with the results of explicit Feynman diagram calculations of the 
electron-graviton vertex in a Minkowski spacetime background. 

3. Lorentz and CPT Violating QED 

Lorentz invariance and CPT symmetry are fundamental properties of conventional quan- 
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turn field theories. Ncvcrtlielcss, it is interesting to speculate that they may not be exact in 
nature and to study possible signatures for their violation, if only to stimulate increasingly 
precise experimental tests. A formalism for studying Lorentz and CPT violation within 
the framework of the standard model has been proposed and extensively explored in recent 
years by Kostelecky and collaborators[14, 15]. In this section, we briefly review this ap- 
proach, emphasising the close technical similarities to our own work on strong equivalence 
violation. 

In the Kostelecky approach, explicit Lorentz violation is introduced by writing a phe- 
nomenological Lagrangian comprising tensor operators with coupling constants carrying 
spacetime indices. These couplings are simply collections of numbers, not tensor fields 
in their own right. However, they may, though this is not at all necessary in the phe- 
nomenological approach, be thought of as Lorentz-violating VEVs of tensor fields in some 
more fundamental theory exhibiting spontaneous Lorentz violation. Restricting to renor- 
malisable interactions, the most general such extension of QED is therefore (changing the 
notation of ref.[14] slightly): 



With this notation, the couplings a^, b^,Lf^ of the super-renormalisable operators have 
dimensions of mass while c^j,y, d^i,, /i^^, K^^xp stre dimensionless. 

Here, we follow Kostelecky and restrict attention to the simple renormalisable La- 
grangian (3.1). Of course, nothing prevents us from introducing further higher dimensional 
operators, particularly those with extra derivatives analogous to eqs.(2.2) and (2.8). This 
would be in the spirit of regarding this phenomenological action as a low-energy effective 
action in a theory in which Lorentz symmetry is broken at a high scale M. 

Since Lorentz invariance is explicitly broken in (3.1), one of the axioms of the CPT 
theorem is not respected and therefore CPT symmetry is no longer guaranteed. In fact, the 
super-renormalisable operators with couplings a^, b^, L'^ are CPT odd, while c^y, d^i,, h^y, 
K^uXp multiply CPT even renormalisable operators. A phenomenological model of CPT 
violation is therefore obtained by using the Lagrangian (3.1) with the couplings aij,,by or 

non-zero. It should also be noticed that in comparing results derived with positrons 
and electrons, the substitutions —o-n, d/^^ — ^ —dfj,i, and h/^^ — ^ —h/^y should be made 

with the other couplings unchanged. 

Now, it is evident that the Lorentz-violating QED Lagrangian (3.1) and the strong 
equivalence violating extended QED Lagrangians (2.2), (2.8) have many formal similarities. 
Tensor operators appear in each, though the role of the curvature tensors in eqs.(2.2), (2.8) 
is played by the multi-index, but non-tensorial, coupling constants in eq.(3.1). Notice 
also that the Lagrangian (3.1) allows for parity- violating operators which we chose to 
exclude from (2.2), (2.8). These would not occur in the quantum effective action obtained 



r = 



)^ (3.1) 
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from QED itself since this theory is parity-preserving, but could be included in a purely 
phenomenological theory. It follows that the analysis of the phenomenological consequences 
of the two theories will be very similar, and many results and predictions can simply be 
transcribed between them. We may also hope that some extra insight may be gained from 
this comparison. The couplings which admit a direct equivalence are: 

- fsD^R/M^ 

K^^uXp ~ coR^uXp/M^ (3.2) 

Of the others, bf^,df^i, and are associated with the parity-violating operators which are 
not included in eqs.(2.2), (2.8). There is also no immediate analogue of the h^i^ term since 
Rf^i, is symmetric and cannot couple to the operator ■ipa^'^ip without higher derivatives. 

Probably the most useful correspondence is of the set of couplings K^i^xp with the 
Riemann tensor R^iyXp- K^j^xp inherits the algebraic symmetries of the operator F^^F^f, 
which exactly matches the Riemann tensor. Just as wc found it convenient to separate the 
Riemann tensor into the Ricci tensor and the tracc-frcc Weyl tensor, so it will be useful to 
split K^j^xp in the same way, defining the traced components K^x ^-nd K and the trace-free 
components cj^l^ using the same formula (2.5) that defines the Weyl tensor. We will see 
in the next section, where we go further and introduce the Newman-Penrose formalism, 
that this is a useful analogy to exploit in the analysis of the birefringent propagation of 
light. 

In the remainder of the paper, we investigate some of the experimental consequences of 
this Lorentz and CPT violating Lagrangian, emphasising wherever possible the similarities, 
and differences, with the strong equivalence violating curved spacetime Lagrangian. First, 
though, we make some remarks about radiative corrections. It will be clear that if we 
take the modified electron propagator from the Dirac sector of (3.1) and substitute it 
into the one-loop photon vacuum polarisation diagram, it will produce a change in the 
photon propagator which can be realised as an 0{a) contribution to the coefficients of the 
Maxwell sector operators. And vice-versa, modifications of the photon propagator induce 
0{a) contributions to the couplings of the Dirac sector operators. The result is that the 
electron and photon sector couplings are correlated by 0{a) radiative corrections. Exactly 
the same is of course true for the strong equivalence violating action. 

It follows that if we are able to put a bound on the Dirac couplings a^, ... , h/^i, from 
experiments on atomic spectroscopy, then this will imply bounds on the Maxwell couplings 
l^fivXp^L^ which are tested in astrophysical polarimetry experiments. And conversely, 
known bounds from astrophysics could imply bounds on future precision hydrogen and 
anti-hydrogen spectroscopy experiments, such as those planned by ATHENA and ATRAP 
at CERN. The possibility that astrophysics and atomic spectroscopy experiments could be 
correlated in this way was one of the principal motivations for this paper. 

Perhaps the most interesting of these correlations would be those involving operators 
which give rise to CPT violation in the anti-hydrogen spectrum. In particular, consider 
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the parity-violating Dirac coupling 6^. At first sight, this appears to be linked via radiative 
corrections with the coupling L'^ of the Chern-Simons operator. Since, as we see in the 
next section, the Chern-Simons term predicts a birefringent rotation of linearly polarised 
light which could in principle be observed in the synchroton radiation from radio galaxies, 
we are able to place bounds on L^^ from astrophysics. We would therefore expect this to 
imply a bound on 6^, which would limit the size of CPT violation which would be observed 
in the spectroscopic measurements at ATHENA and ATRAP. 

Curiously though, the radiative correspondence between the axial- vector operator cou- 
pling and the Chern-Simons coupling is far from straightforward theoretically. The 
question of whether a Chern-Simons term is induced via the one-loop vacuum polarisa- 
tion diagram where the fermion propagator is modified to include an axial-vector coupling 
turns out to be a subtle one in quantum field theory, involving ambiguities associated with 
the axial anomaly. A very careful analysis of this issue has been carried out by Jackiw 
and Kostelecky[19], who conclude that the question "is a Chern-Simons term induced by 
vacuum polarisation in the theory with the Dirac action modified by an axial-vector term" 
has in fact no unique answer. Depending on how we attempt to give a definition of the 
theory taking proper account of the axial anomaly, a variety of reasonable answers may be 
given, including that the induced = 0, or simply indeterminate. 

In view of this interesting but indeterminate theoretical situation, it would therefore 
be unwise to place too much confidence on a prediction of a bound for CPT violation in 
anti-hydrogen based on the bound from radio galaxies. 

4. Astrophysical Polarimetry 

We now turn to the phenomenological consequences of the strong equivalence, Lorentz and 
CPT violating Lagrangians introduced so far. In this section, we focus on the photon sector 
and in particular on the polarisation dependence of the propagation of light (including radio 
waves and gamma-rays) in astrophysics. 

4.1 Light propagation and birefringence 

We begin with a brief review of light propagation based on the strong equivalence violating 
Lagrangian (2.2). This has been studied extensively in a series of papers [1, 2, 3, 4, 5, 6, 7] 
where we have explored the possibility of superluminal propagation and its implications 
for causality as well as gravitational birefringence and dispersion. The dispersive nature 
of light propagation, which as explained in ref.[6, 7] is essential in a precise analysis of 
causality and possible superluminal signal propagation, only becomes apparent when the 
full effective action (2.2) including the higher derivative operators is used. On the other 
hand, the basic features of gravitational birefringence are already apparent in the low- 
momentum approximation to the action first derived by Drummond and Hathrell[l]. In this 
section, wc will restrict ourselves to this simple case. This is also most directly analogous 
to the Kostelecky Lagrangian (3.1) with its restriction to renormalisable operators only. 

To study the propagation of light in this theory, we use the formalism of geometric 
optics. The electromagnetic field is written in the form yl^ = ^a^expi^, where A is the 
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amplitude and is the polarisation vector. The amplitude is taken to be slowly-varying 
on the scale of the rapidly- varying phase 6. The wave- vector (equivalent up to some 
subtleties[7] to the photon momentum) is identified as A;^ = 8^6. The essential features of 
light propagation are then obtained from the equations of motion of the extended Maxwell 
action by considering the highest order terms in a controlled expansion in the rapidly- 
varying quantities. All this formalism is explained in our previous papers (see [27] for 
a pedagogical review). The result is the following equation for the wave vector and 
polarisation a^^: 

k'^a" - k.a k" - ^^^^ R\p (k^kPa" - k^k^a^^ - C^^'" pk^kfa^" = (4.1) 

where a, b, c are coupling constants given from eq.(2.2) as a = ao(0, 0, 0), etc., and we have 
explicitly introduced the Weyl tensor. The modified light cone follows immediately from 
the condition 

k g^i> — kfjku {^\pk^kP Qixv ~ Rxfik^kJ^ — C^xi^pk^k^ = (4.2) 



det 



The physical light cone for photon propagation therefore no longer coincides with the 
geometric light cones of the background curved spacetime, and so eq.(4.2) predicts that 
the speed of light will be different from the fundamental 'speed of light' constant c = 1. 
The remarkable feature implied by (4.2) is that in some cases it predicts that the speed 
of light may be greater than 1, i.e. light propagation may be superluminal. For a careful 
discussion of what this means physically, we refer to our earlier papers [6, 7]. The second 
main prediction of cqs.(4.1), (4.2) is gravitational birefringence, viz. that the physical light 
cone (speed of light) will depend on the polarisation if, and only if, the Weyl tensor is 
non-vanishing. See below for an explicit demonstration. 

As we have seen in earlier work, it is particularly useful in studying the implications 
of eqs.(4.1), (4.2) to adopt the Newman-Penrose formalism. (See ref.[4] for a relevant 
summary.) This involves introducing a null tetrad at each spacetime point via a set of null 
vectors ,fh^ satisfying A.n = 1, m.m = —1, l.m = i.fh = n.m = n.fh = 0. All 

tensors are then specified by their components in this basis. In the discussion below, we 
will take as the direction of propagation, so that A;'* = iui^. = IS a 

complex linear combination of two spacelike vectors transverse to It follows that m'^ 
and fh^ represent respectively left and right-handed circular polarisation vectors, i.e. the 
photon helicity ±1 eigenstates. The ten independent components of the Weyl tensor are 
described in the Newman-Penrose formalism by five complex scalars (i = 0, . . . , 4) , 
where 



^0 = 




^1 = 




^2 = 


-C^^Xpi^m^rh^nP 


*3 = 


-C^^Xp^^jfm^nP 


^4 = 


—C p^vxpn^'f^" '^'^'^^ 
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(4.7) 



with similar definitions for the ten independent components of the Ricci tensor. Here, we 
only need the notation $00 = —^Rfiu^^^'^- 

Applying this formalism to eq.(4.1), and choosing the polarisation to be a linear com- 
bination af^ = am^ + l3fh^ of the left and right circular polarisations, we find the matrix 
equation 

where k'^ = + ^^|^ a;^$oo and we have used the identity C^^xpi^^rrf t^rhP = 0. The 
eigenvalues give the modified light cone^ 

,2 45 + 8c 2 , 8c 2 IT I /. 
This implies a (non-dispersive) polarisation-dependent shift in the speed of light of 

for the two polarisation cigcnstatcs. We can now show that if we define a phase ?? from the 
(complex) N-P scalar, = |^'o|e*^, then the eigenstates are 

a^' = ^ 

The states which propagate with a well-defined velocity are therefore superpositions of the 
left and right circular polarisations with equal and opposite phases determined by the Weyl 
tensor. They are therefore orthogonal linear polarisations, the direction being determined 
by the Weyl phase. 

It should now be reasonably clear that we can equally well apply this formalism to 
the case of the Lorentz violating phenomenological theory in flat spacetime. (The formal 
discussion in the remainder of this section and in section 4.2 should be compared with the 
treatment of photon propagation in refs.[14, 18], where similar issues are addressed without 
using the Newman-Penrose formalism. The main physics conclusions are the same.) As 
already noted, the coupling K^^xp appearing in the Kostelecky Lagrangian (3.1) plays 
essentially the same role as the Riemann tensor in the above discussion. In particular, it 
has the same number of independent components as R^uXp by virtue of having the same 
algebraic symmetries, which arc inherited from the operator it multiplies in the Lagrangian. 
We can therefore introduce the Newman-Penrose formalism in this context also and define 

(K) (K) 

N-P scalars ^'^ (i = 0, . . . , 4) from the trace-free analogue C^^^;^p of the Weyl tensor (and 
similarly for the N-P analogues from the traced components K^i,). This helps to put 
some order into the plethora of components of K^i^xp and extract the essential physics. 
All the results derived above can now be directly translated. In particular, provided the 

(K) 

relevant components of C^^xp non-vanishing, there will be a birefringent velocity shift 
analogous to eq.(4.6), viz. 



5v = -4 ± 4 (4.8) 



^Eq.(4.5) corrects the corresponding equation in ref. [4], where a similar expression was quoted that, 
although written in terms of VPo and ^'oi "^as only valid, and was only used, for the case of real ^0. 
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The velocity eigenstates are given by superpositions of the left and right circular polari- 
sations as in eq.(4.7), where now "d is the phase of ^'g^^- They are therefore orthogonal 
linear polarisations with orientation determined by 

Light with a general linear polarisation is a superposition of these eigenstates. As 
they propagate through some distance D, the eigenstates will pick up a phase difference 
5(f) = ib^^(5f, where A is the wavelength. The resulting superposition is then no longer 
a pure linear polarisation, but will be elliptically polarised.^ For a generic orientation of 
linear polarisation, the major axis of the ellipse will be rotated relative to the direction of 
linear polarisation by an angle of 0{5(t)). The ratio of minor to major axes of the ellipse 
is also 0{5(f)), the exact expressions in each case depending in a non-trivial way on the 
angle between the polarisation directions of the light wave and the velocity eigenstates. 
The transformation of linear to elliptic polarisation with a dependence on the propagation 
distance D is therefore a clear signal for Lorentz breaking through a 'Weyl' mechanism 
arising from the couplings -ftT^i/Ap in the phenomenological Lagrangian. 

A similar analysis also applies to the Chern-Simons interaction in the Lagrangian (3.1). 
In this case, the appropriate equation of motion is 

k'^a" - k.a + ie^.^xpL^'k^a'' = (4.9) 

The modified light cone is derived from 



det 



(4.10) 



k g^u kfxku i^ni/XpL k^ — 

which, for physical polarisations, implies[17] 

k'^ + k^L'^ -{L.kf = (4.11) 
To first order in L'^, this gives a birefringent phase velocity shift^ 



5v = ±WL.k (4.12) 



The eigenstates themselves follow from eq.(4.9). As before, choose the physical transverse 
polarisation vectors to be linear combinations of the left and right circular polarisations, 
i.e. = am^ + (3rh'^. In this sector, the equation of motion becomes 

e-ie^uXpm^^fh'^k^^LP \ (''\ = (ai'^) 

e - i€^^xprn^'m''k^LP J \(3j \0 J ^' ' 



^This is the standard situation in optics of superposing two orthogonal plane waves with a phase differ- 
ence. The resulting figure swept out by the polarisation vector is in general an ellipse, but degenerates to 
a straight line for special values (0, n, 27r, . . .) of the phase difference. If the initial two waves have different 
amplitudes, then the orientation of the principal axes of the ellipse, as well as the ratio of its major to 
minor axes, depends both on this amplitude ratio as well as on the phase difference. The resulting ellipses 
are the simplest examples of Lissajous figures. See, for example, ref.[28] for a detailed description. 

^Notice that, unlike the 'Weyl' case, eq.(4.11) implies a non-trivial dispersion relation. In particular, 
while the phase velocity satisfies eq.(4.12), the shift in the group velocity is second order in L^, i.e. = 
1 + 0(L2) [17]. 
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We see immediately, that in contrast to the 'Weyl' case above, the symmetries of the 
antisymmetric tensor ensure that this matrix is diagonal and therefore the left and right 
circular polarisations are themselves the eigenstates, i.e. the states which propagate with 
a well-defined velocity. We can readily check that the light cone condition following from 
eq.(4.13), viz. 

= ± ie^^xp m^'myk^LP (4.14) 

reproduces cq.(4.11). 

As noted above, linearly polarised light may be considered as a superposition of the 
left and right circular polarisations with a phase difference which determines the direction 
of linear polarisation. If the left and right circular polarisations propagate at different 
speeds, then this phase difference will vary with the distance propagated, thereby rotating 
the direction of linear polarisation. This birefringent rotation angle is half the induced 
phase difference between the left and right circular polarisations due to their velocity dif- 
ference. Linearly polarised light propagating through a distance D will therefore experience 
a birefringent rotation of the direction of polarisation given by 

5(1) = ^ 5v (4.15) 
A 

where 5v is given by eq.(4.12). 

We see, therefore, that there is a distinct difference between the birefringence effects 
associated with the 'Weyl' (*(^)) and Chern-Simons (L'^) types of Lorentz breaking. While 
the latter simply rotates the direction of linear polarisation, the former in addition turns 
linear polarisation into elliptic polarisation. Either effect is a clear signal for Lorentz 
violation; the presence of the latter is characteristic of the parity and CPT violating Chern- 
Simons interaction. 

Clearly, since 5v is expected to be tiny given that we know Lorentz invariance to 
be a good symmetry to high accuracy, in order to achieve measurable effects we need 
the longest possible baseline D, i.e. we need to study deep astrophysical sources emitting 
linearly polarised radiation for which we can associate a definite direction. We now consider 
two possible types of source. 

4.2 Radio galaxies 

A suitable set of astrophysical sources for testing this Lorentz-violation induced birefrin- 
gence are radio galaxies [17]. These emit synchrotron radiation, which is linearly polarised. 
Moreover, for a large class of these objects, it is possible to identify an axis along which 
they are elongated, and models suggest that the alignment of their magnetic field deter- 
mines the orientation of the emitted linear polarisation to be either along, or orthogonal 
to, this axis. 

Carroll, Field and Jackiw[17] have carried out a detailed study of a sample of 160 such 
radio galaxies, with redshifts typically in the range z ^ 0.1 to 2; ~ 1. Making cuts to those 
with polarisations above 5% and with z > 0.4, and after compensating for the effects of 
Faraday rotation by the intcrgalactic magnetised plasma, they find clear evidence that the 
observed orientation of the linear polarisation is indeed peaked around 90° relative to the 
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axis of the radio galaxy. The mean angle is found to be 90.4° ib 3.0°. They conclude 
that at the 95% confidence level, the birefringent rotation angle implied by a Chern-Simons 
mechanism is 6(f> < 6.0°. 

To convert this into a bound for the Chern-Simons coupling, we need to convert 
from the redshift to the proper distance D. For an O = 1 matter dominated universe, 
this gives D = 3^(^1 — (1 + where Hq is the Hubble constant. Letting h = 

Fo/lOO km sec"^ Mpc~\ the recent WMAP data[29] determines h = 0.72 ± 0.05. It 
follows from eqs.(4.12), (4.15) that 

S(f) = L.k (4.16) 

= 3^ (l-(l+z)-i) L.k (4.17) 
from which, taking z ^ 0.4, we deduce the following bound on the Chern-Simons coupling[17] 

\L.k\ < 1.2 X 10-^2 GeV (4.18) 

Alternatively, from the discussion following eq.(4.8), we can use the observation of 
linear (rather than elliptic) polarisation, with the predicted orientation, to estimate a 
bound for the dimensionless 'Weyl' coupling ^q^^ obtained from the couplings K^y\p in 
the phenomenological Lagrangian (3.1). With the above bound on dcj), and a typical radio 
galaxy wavelength of A '-^ 20cm, we can use eq.(4.15) together with (4.8) to obtain the 
following estimate: 

< 0(10-29) (4.19) 

Notice that because of the qualitatively different dispersion relations for the 'Weyl' 
and Chern-Simons mechanisms, i.e. 5v ~ const for 'Weyl' (eq.(4.8)) and 5v ~ l/k for 
Chern-Simons (eq.(4.12)), the birefringent rotation angle ~ DSv/X is independent of the 
frequency of the radiation for Chern-Simons but is proportional to frequency for the 'Weyl' 
CclSG, clS used above. ^ It follows that in order to bound the 'Weyl' coupling, we can look 
not only for sources with long baselines D, but also study them at shorter wavelengths. 

Taking advantage of this, a more recent estimate has been made by Kostelecky and 
Mewes[18] using a smaller sample of higher redshift galaxies and quasars for which polari- 
metric measurements are available in the infra-red, optical and UV. Prom this, they are 
able to quote an improved bound of 0(10-^^) on a typical component of K^^j^xp- 

4.3 Gamma-ray bursts 

As noted above, one way to improve these bounds is to increase the baseline distance D by 
studying polarisation in astrophysical sources at higher redshifts, while for some couplings, 
observing sources at higher frequencies will also help. An intriguing new possibility in this 

^It should be noted, however, that the constant dispersion relation in the 'Weyl' case is a consequence of 
only retaining the renormalisable dim = 4 operator in the extended QED Lagrangian. In the general case 
where higher-dimensional operators are considered as well, as in the strong equivalence violating Lagrangian 
in section 2, the corresponding dispersion relation is highly non-trivial[6]. 
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direction has been opened up in the last few years by the detection of linear polarisation in 
gamma-ray burst (GRB) afterglows, i.e. the fading optical phase observed after the initial 
prompt burst of gamma-rays, and recently in the prompt gamma-ray phase itself. 

The first observation of polarisation in GRB afterglows was made in 1999 in GRB990123, 
which has a redshift of 2; = 1.60. Since then, around ten further observations have been 
made. (Sec rcf.[30] for a complete list.) Amongst these, the highest redshift recorded so 
far is z = 2.33, for GRB021004. 

It is very likely that in the next few years increasingly precise observations of GRBs, 
notably with the forthcoming launch of the SWIFT satellite, will substantially increase 
the data set of observations of polarisation in GRB afterglows. Eventually we may hope 
that polarisation measurements on GRBs will evolve from their present role as probes of 
the basic mechanism of GRB production to tools which we can exploit to constrain the 
fundamental physics of light propagation. If GRB polarisation does indeed come to be 
as well understood as the polarisation from radio galaxies discussed above, the increase 
in redshift by a factor of around 10 will allow us to tighten the bounds (4. 18), (4. 19) still 
further. 

In order to realise this programme, we need first of all to observe intrinsic linear 
polarisation in high redshift GRBs, and also to understand its direction in terms of the 
geometry of the source/observer so that wc can place bounds on any birefringent rotation. 
We also need to understand the effect on polarisation of the transmission of the afterglow 
light through the host galaxy (see e.g. refs.[31, 32], a challenging task analogous to the 
subtraction of Faraday rotation in the intergalactic medium in the analysis of polarisation 
with radio galaxies. It will clearly be some time before all this can be achieved, but our 
purpose in this section is simply to draw attention to the possibility of using GRBs as 
controlled long-baseline light sources for the study of fundamental physics such as the 
possible violation of Lorentz invariance. 

As an illustration of how directed, linear polarisation can arise in GRB afterglows^, 
we consider a simple model proposed by Ghisellini and Lazzati[33]. Several other models 
can be found in the literature, which differ in important respects and should be readily 
distinguishable as more data becomes available.^ The key features in many cases though 
are common: (i) GRBs arise through the ejection of a jet of relativistic matter in a narrow 
cone from a central source, almost certainly a supernova. This jet structure removes spher- 
ical symmetry and provides the essential axis for the source analogous to the elongation 
direction of the radio galaxies, (ii) Synchrotron radiation is emitted from the relativistic 
electrons accelerated in the magnetic field of this jet, with the radiation emitted at 90° 

®In what follows, we are describing a mechanism for polarisation in GRB afterglows, where it seems to 

be accepted that the relevant production mechanism is synchrotron radiation. Polarisation of the prompt 
gamma-rays in a GRB has recently been observed in GRB021206. Here, authors appear divided on whether 
the origin is synchrotron radiation or inverse Compton scattering. See ref. [32] for a brief summary. 

'^We do not enter here into the comparison of models, which differ in such important features as whether 
the magnetic field in the jet of ejected matter is completely tangled or partially ordered, whether the jet is 
homogeneous or structured, and whether or not the jet cone opening angle is itself sufficiently small that 
only the angle to the observer's line of sight is relevant. See, for example, refs.[34, 32] for an introduction 
to the literature. 
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being linearly polarised, (iii) This synchrotron radiation is Lorentz boosted by the usual 
relativistic aberration mechanism into the forward direction where it can be received by 
the observer, (iv) Crucially, observations are made off the axis of the jet cone. This 
introduces the asymmetry necessary to observe a non- vanishing net polarisation. 

Consider then a shell of magnetised plasma moving outwards relativistically in a jet 
making a cone of half-angle 9c- Synchrotron radiation is emitted from the shell with the 
radiation emitted in its plane, i.e. at 90° to the outward motion, being linearly polarised. 
This jet is observed off-axis by an observer whose line of sight makes an angle Og with the 
jet axis. See Figure 1. 




jet axis 



observei 
axis 



Figure 1: A jet comprising a shell of magnetised plasma is emitted radially outwards forming a 
cone of half-angle Oc- It is observed off-axis by an observer whose line of sight makes an angle 
with the jet axis. 

Because of relativistic aberration, photons emitted in the plane of a segment of the 
shell, in its comoving frame, are observed in the forward direction making an angle ~ I/7 
with the normal, where 7 is the Lorentz factor. See Figure 2. This means that the observer 




observei 
axis 



Figure 2: Polarised light emitted at right angles to the outward radial motion of a section of the 
plasma shell is relativistically aberrated into the forward direction, where it makes an angle ^ ~ I/7 
with the normal to the shell. It is observed at the edge of the circle subtending a cone of half-angle 
9 about the observer's line of sight axis. 

receives linearly polarised light from the edge of the circle subtending a cone of half-angle 
9 around the line of sight Oq- The polarisation direction is radial with respect to this circle. 

If the observer's cone lies entirely within the jet cone, then this whole circle is observed 
and the net polarisation is zero through cancellation of the radially-directed polarisation 
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around the circle. As 9 increases, however, only a part of the observer's cone overlaps 
with the jet cone. In this case, only part of the circle is observed and a net polarisation is 
measured. See Figure 3. Finally, as increases still further, the observer's cone completely 
envelopes the jet cone and again no polarisation is observed. 

The criterion for observing a net linear polarisa- 
tion is therefore 

Be + 60 > e > Oc-Oo (4.20) 

that is, 

6c + Oo ^ Oc — ^ ^ 

Notice that this is only possible if 9o 7^ 0, i.e. if the 
GRB jet is viewed off-axis. 

To summarise, at the beginning of the afterglow 
when 7 is large, the observer sees only a fraction of 
the full jet (7 > l/6c — Op) and no net polarisation 
is observed. As 7 drops as the afterglow evolves, we 
enter the asymmetric stage where the observer's cone 
overlaps partly with the jet cone - this results in an 
observed net polarisation. As 7 falls still further (7 < 
l/9c + 9o), the entire jet cone becomes visible and the 
polarisation vanishes. 

This model also makes a definite prediction for 
the direction of polarisation. The derivation is given in detail in ref.[33] and consists of 
integrating the Stokes' vectors Q and U around the visible arc of the observer's circle. The 
result, which is a peculiarity of this precise geometric model, is that the polarisation is 
initially orthogonal to the plane containing both the jet cone axis and the line of sight, 
before flipping abruptly to lie in this plane. All models of this type, however, make a 
well-defined prediction of the direction of polarisation. 

For our purposes, the crucial point to be extracted from this model is that we should 
expect to observe a net linear polarisation, with a specified direction established with 
respect to the plane formed by the jet axis and the observer's line of sight, in a situation 
where a GRB jet is observed off-axis (as will generically be the case). The existence, and 
possibly direction, of this linear polarisation is expected to change with time as the Lorentz 
factor 7 reduces as the expanding jet slows. 

GRB afterglows therefore satisfy the criteria we need for the long-baseline astrophysical 
polarimetry measurements which can test Lorentz and CPT symmetry to high accuracy. 
Moreover, if polarisation measurements meeting these criteria can eventually also be made 
on the prompt gamma-rays themselves, we could also improve the bounds on the 'Weyl' 
coupling because of the extremely small wavelength factor in the birefringent rotation 
formula, as discussed above. 




Figure 3: The circle C subtends the 
jet cone. Circles Ci and C2 sub- 
tend the 'observer's cone' of half-angle 
^ 1/7 as increases. Initially Ci 
lies entirely within C, while for lower 
values of the Lorentz factor 7 there is 
only a partial overlap, enabling a net 
linear polarisation to be observed. 
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This combination of large rcdshifts and high frequencies means that GRBs have the 
potential ultimately to give the most accurate limits on the Lorentz and CPT violating 
couplings l^f^^^l and 

5. Precision Atomic Spectroscopy 

While testing the Maxwell sector of the Lorentz and CPT violating theory requires obser- 
vations on astronomical scales, testing the Dirac sector is the domain of precision atomic 
physics experiments. In this section, we will consider the implications of the phenomeno- 
logical Lagrangian (3.1) for precision atomic spectroscopy, in particular with hydrogen and 
anti-hydrogen. The possibility of making spectroscopic experiments on anti-hydrogen and 
looking for CPT violation follows from the production for the first time of substantial num- 
bers of cold anti-hydrogen atoms at the ATHENA[20, 21] and ATRAP[22, 23] experiments 
at CERN in 2002. This subject has already been investigated in detail by Bluhm, Kost- 
elecky and Russell [24], but here we shall push their analysis a little further to see whether 
there are further options for CPT tests which could be realised at ATHENA and ATRAP. 

5.1 Hydrogen spectrum with Lorentz and CPT violation 

Our starting point is the textbook derivation of the relativistic fine structure of hydrogen 
energy levels starting from the Dirac equation, rather than the more familiar derivation 
based on perturbation theory and the non-relativistic Schrodinger equation. In outline, 
this derivation goes as follows. The non-relativistic Schrodinger equation for hydrogen 
reduces to 

2m\dr'^ r dr r"^ J r 

where the wave function il) = exp{—iEt)ijjni{r) , L'^tpnt = ^(^ + l)V'n£) ^^iid the energy 
eigenvalues are E = —^ma^/n?, where the integer n satisfies £ = 0,1, . . . ,n — 1. 

We now reduce the full QED field theory to one-particle relativistic quantum mechanics 
in the usual way, regarding the Dirac equation as a relativistic wave operator, in a classical 
background field, acting on a four-component wave function ip, i.e. 

(i-r^D^ - m) V = (5.2) 

where D/^ = + leA^. Now act on this with the Dirac operator with m — —m. This 
gives 

(-D"^ - ea^'T^^ -m^^ ip = (5.3) 

Substituting the Coulomb potential eAo = Ai = 0, and using the explicit expressions 
for the 7 matrices^, this becomes 



Ml) = (5.1) 



/ ^2 2 a \ _ L2 /a\2 ^2 2 .oifa.f 

\dr^ r dr J \r ' r r^\Q —a.r 



V' = (5.4) 



^Our conventions for the 7 matrices are: 
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which separates into two almost identical equations for the two-component spinor wave 
functions in the decomposition tp = ^ , viz. 

-> 1 n, ^ .1 

V'i = (5.5) 



+ -^-1 - - ± iaaAi. + 2E- + - m'^ 
\ ar^ r or J y. J r 



We can show[35] that the eigenvalues of the operator {L — • • •} can be written as A(A + 1) 
where A = [{j + i)^ — a^]^ , [{j + 1)2 _ a^]^ — 1 for j = ^ ± 1 respectively. In each case, 
this gives X = i — 5 where 5 = 2J+1 ^ + O(a^). 

Now, comparing eq.(5.5) with the non-relativistic Schrodinger equation (5.1), we de- 
duce by inspection that the energy eigenvalues in the relativistic equation satisfy 

E^-rn^ = - . (Eaf (5.6) 

[n — 0)'^ 

Expanding in powers of a, we recover the familiar expression for the fine-structure correc- 
tion to the hydrogen energy levels in the relativistic theory: 

Eps = m - Ima^ ^ + '-ma' -L (i. _ + 0(«6) (5.7) 

We now consider the implications of changing the Dirac equation to the extended one 
implied by the Lorentz- violating Lagrangian (3.1), viz. 

(iji'D^ -m- a^j'' - b^j^'' + ic^u^D" + id;,.7^7''D'' - ^ V^''") ^ = (5-8) 

Multiplying as before by the equivalent operator with m —m, we find that the extended 
Dirac one-particle relativistic wave equation analogous to eq.(5.3) is 



= (5.9) 



To see the effect of the new Lorentz and CPT violating couplings, consider first just 
adding the a^^ and terms. Using the explicit expressions for the 7 matrices, we find, to 



where a are the Pauli matrices, and 

LIU \ II I/l 

which implies 

Oi i / Cr' \ ii 1 ijkf ffc 

" =-2^0 -a^J " =2' io a. 
We also need the following identity in the derivation of eq.(5.4): 

{7,0-}= - e 7 7p 
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first order in the small couplings afj_,bn: 



or^ r or I r-' \ r J r v' 



+2Eao + 2ao^ - h.a-\^ b.a 



— £■£ , 

ip + O(VV') = (5.10) 



We now come to the key step. Comparing eqs.(5.4) and (5.10), we see by inspection that 
to first order in the couplings, the energy eigenvalues are simply related by 

Ei/j (^E + ao-^b.a^i^ (5.11) 

That is, introducing the spin operator S_ = ^a, the energy eigenvalues in the Lorentz 
violating theory are 

E = Efs + ao - ^ {b.S) (5.12) 

Extending the analysis to include the remaining couplings, we readily find the complete 
result 

E = Eys + {ao + mcQo) - -{h-mdio + eijkW^) {S') (5.13) 

To complete the derivation, we need the matrix element ( 5* ) taken in the basis 
of states \n£sjmj ) appropriate for fine structure.^ In fact, this is well-known from the 
perturbative analysis of the Zeeman effect. The result is 

( nisjnijl Sz \nisjmj ) = g rrij (5-14) 



^The evaluation of the matrix element of the spin operator ( n£sjmj\ Si \nlsjmj ) proceeds as follows. 
Abbreviate the notation \n£sjmj ) to \jmj ). Prom the definition of the Clebsch-Gordon coefficients, 

) = ^ \mems ) { nnms\jmj ) 

7ng,7ns 

we readily find 

( j'm'jl S;, \jmj ) = rus { j'm'j\mj -rua, rus) {rrij -rus, ma\jmj ) 

ms 

while 

( j'm'jl S± \jmj ) = ^ \/(s - ma){s + + 1) ( j'm'j\mj - m,, rua + 1 } { ruj - rria, ms\jmj ) 

For s = i, and using the known expressions for the Clebsch-Gordon coefficients [36], we then find for 
m'j = rrij and the two possibilities f ,j = £ ± ^, 

(l±^ niji S:, \l±i nij ) = ± -T^—r mj 



'+2 



1\2 



2 " ' ' 2 ' ' 2e+i 
while { S± ) — 0. The diagonal matrix element is the result quoted above. 
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where g is the Lande ^-factor, 

1 



2j(j + 1) 
while {S±) =0. For s = i, 



(j{j + l)+s{s + l)-e{e + l)) (5.15) 



( nisjrrijl \nisjmj ) = ± ^ rrij (5.16) 

for the two possibiUtics j = C- ^ \. Putting all this together, we finally find the fol- 
lowing result for the energy eigenvalues for hydrogen in the Lorentz and CPT violating 
phenomenological theory (3.1): 

E = Eps + (ao + mcoo) - ^{h - mdso + hi2) (±) ^^^ rrij (5.17) 

for j =£±i. 

The important new feature here compared with the result quoted in ref . [24] , which was 
concerned purely with the Is — 2s transition involving only £ = states, is the i dependence 
in the general case arising from the Lande (/-factor. 

Notice also that since the contribution of the and c^i, couplings to the energy levels 
is independent of the angular momentum and spin quantum numbers, they do not affect 
energy differences between states so play no role in determining the transition frequen- 
cies. The couplings which do have the potential to change the transition frequencies are 
b^jdi^i^ and /i^,/. These are the parity- violating couplings which did not have analogues in 
the strong equivalence violating QED Lagrangian discussed earlier (see the dictionary in 
eq.(3.2).) In order to have curvature-induced effects on atomic spectra, we would need to 
go beyond the extended QED model of section 2, either by introducing parity- violating 
couplings on a purely ad-hoc basis into the phenomenological Lagrangian or by embedding 
QED in an extended strong equivalence violating standard model with electroweak parity 
violation. 

5.2 Is — 2s and 2s — nd {n ^ 10) transitions in H and anti-H 

In order to test for Lorentz and CPT violation in atomic spectroscopy, at least in the 
context of this model, we need to compare transition frequencies which are sensitive to the 
couplings 63,(^30 and hi2- One promising approach is to compare the frequencies of the 
equivalent spectral lines in hydrogen and anti-hydrogen, now that the abundant production 
of cold anti-hydrogen atoms at ATHENA[20, 21] and ATRAP[22, 23] has made precision 
spectroscopy on anti-hydrogen feasible for the first time. 

As noted in section 3, the couplings change in the following way under charge conju- 
gation: -a^, bfj,, c^^, d^„ -d^„, -h^^. Transition frequencies 
sensitive to the combination (63 — md^Q + h^) will therefore be different for hydrogen and 
anti-hydrogen. Measuring such a difference in their spectra would be a clear signal for 
Lorentz, and possibly CPT, invariance. 

So far, attention has focused on the Is — 2s transition. This is a Doppler-free, two- 
photon transition: two-photon since it violates the usual single-photon, electric-dipole 
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selection rule = 1 and Dopplcr-free since the recoil momentum from the emission of the 
two photons cancels. The 2s state is therefore exceptionally long-lived (lifetime 122ms) and 
the Is — 2s spectral line has an ultra-narrow natural linewidth of 1.3Hz, giving a resolution 
of O(10~^^). (A = 243nm for the two-photon Is — 2s transition.) It has been suggested 
that an even more accurate measurement of the line centre to ~ ImHz, corresponding to 
an ultimate resolution of one part in 10^^, is in principle attainable [37, 38]. The Is — 2s 
transition is therefore favoured for precision spectroscopy. 

The selection rules for two-photon transitions are derived in ref.[39]. The principle is 
straightforward. The transition amplitude depends on the product of two vector dipole 
operators, which is decomposed as the sum (r° -|- T^) of scalar and rank-two irreducible 
tensor operators, where the quantum numbers q are correlated with the helicities of the 
emitted photons. For the special case of ^ = to £ = transitions, only the scalar operator 
Tq can couple the two states and the resulting selection rule is 

Aj = 0, Am^- = (^ = 0^^ = 0) (5.18) 

Applied to the Is — 2s transition, the important fact for us is the constraint Amj = 0. 
According to eq.(5.17), the Is and 2s energy levels receive the same corrections if they have 
the same rrij. The selection rule Am^ = therefore ensures that the Is — 2s transition 
frequency is unchanged by the Lorentz and CPT violating couplings. 

To overcome this problem, Bluhm, Kostelecky and Russell[24] have made a detailed 
study of the Is — 2s hyperfine Zeeman transitions for hydrogen and anti-hydrogen. We shall 
only briefly summarise some of their results here; see ref.[24] for full details. In particular, 
they exploit the n-dependence of the hyperfine splitting for a certain spin-mixed state to 
show that the corresponding Is — 2s hyperfine transition receives a frequency shift due to 
the Lorentz and CPT violating couplings of 

5v^ ~ K{hl-^,- medio + mpd^Q - + h^^) (5.19) 

where k is a combination of magnetic field dependent mixing angles. Notice that we have 
included the corresponding couplings for a modified Dirac equation for the proton as well 
as the electron in cq.(5.19). Since the corresponding hyperfine states in anti-hydrogen have 
the opposite spin assignments to hydrogen, the result in this case is 

Su^ ~ K (^-bl + 6§ - medio + mpd^Q - hl^ + (5.20) 

An observation of this hyperfine transition in hydrogen would therefore exhibit sidereal 
variations because of the frame dependence implicit in the Lorentz violating couplings. 
Moreover, if we could compare with the equivalent transition in anti-hydrogen, there would 
be an instantaneous difference 

Ai^-^ - Az/^ ~ 2k {hi - (5.21) 

This would provide a direct measurement of the CPT-odd coupling 63. A positive ob- 
servation would therefore be a signal of the violation of CPT symmetry. If Az/ could be 
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measured with an accuracy comparable to the ultimate line centre of ImHz, this would 
place a bound of < 10~^''GeV on the CPT-violating coupling^°. 

The magnetic field needed to resolve the hyperfine states is naturally present because 
cold anti-hydrogen is produced in magnetic fields in the ATHENA and ATRAP experiments 
and it is likely that spectroscopic measurements will be made on anti-hydrogen atoms 
confined in a magnetic trap. (For details, see e.g. ref.[42].) However, the inhomogeneous 
nature of the trapping fields produces Zeeman broadening of the spectral lines and limits 
the resolution that can actually be achieved in practice. It is estimated in ref.[24] that this 
could broaden the Is — 2s hyperfine spectral line described above to over IMHz width, 
which corresponds to an actual experimental resolution of only one part in 10^. In ref.[38], 
the possibility of achieving a width of 20kHz is envisaged. Reducing Zeeman broadening 
and developing techniques to determine the line centre to even higher accuracy in order 
to approach the theoretically limiting resolution of the Is — 2s transition is therefore an 
important experimental challenge [42]. 

In the remainder of this section, we will investigate an alternative to Is— 2s for precision 
studies of Lorentz and CPT violation, exploiting the ^-dependence in the Lande ^-factor 
in the formula (5.17) for hydrogen energy levels. Of particular interest is the Doppler-free, 
two-photon 2s — nd transition for n ~ 10. This turns out to be surprisingly competitive 
with the Is — 2s transition for practical precision spectroscopy. Indeed, at the time of 
writing of the comprehensive review of hydrogen spectroscopy ref.[43], the 2s — 8, 10, 12d 
transitions provided the most accurate determination of the Rydberg constant, to one part 
in 10^^ [44]. In fact, the limitation on further improvement comes not from the natural 
width of the spectral lines but from the accuracy of the optical laser frequencies. The 
2s — lOd natural linewidth is 296kHz while the experimental linewidth reported in ref.[45] 
is 1.25MHz. 

It seems possible, therefore, that in realistic experimental conditions, measurements of 
the 2s — nd (n ~ 10) transition in free hydrogen and anti-hydrogen (see ref.[46] for 'atomic 
fountain' techniques to study free cold atoms outside magnetic traps) could be made with 
a high precision complementing the Is — 2s hyperfine Zeeman transition^^ . The loss of 
accuracy due to the broader natural linewidth of 2s — nd is compensated by the absence 
of the Zeeman broadening which afflicts the Is — 2s hyperfine transition, in addition to a 
variety of other experimental limitations including the accuracy of the laser frequencies at 
the required wavelengths. Naturally, though, a much more detailed experimental analysis 
would be required to confirm whether or not the 2s — nd transition is really competitive, 
especially in the case of anti-hydrogen. 

Our interest in the 2s — nd transition stems of course from the fact that invoking 
an £ = 2 state allows us to exploit the ^-dependence in the energy level formula (5.17). 
While the 2s state has only j = i, = ±i quantum numbers, the nd state allows 

^'^A compilation of similar bounds from a variety of atomic physics experiments is given in ref.[40]. See 
also ref.[41] for a review invoking a broader range of Lorentz violating models. 

^^The case n = 11 may be of particular interest in the ATHENA programme as anti-hydrogen atoms 
in this state will be produced using appropriate laser frequencies to induce transitions from the high— n 
Rydberg states in which they are originally formed. 
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j = |, ruj = ib|,ib^ and j = |, mj = ib|,ib|,ib^. Eq.(5.17) therefore predicts a non- 
zero contribution from the Lorentz and CPT violating couplings at the level of the basic 
spectral line, without needing to introduce a magnetic field to produce hyperfine Zeeman 
splittings. 

The selection rules for 2s — nd are as follows. For A£ ^ 0, the scalar operator r° gives 
no contribution, so the selection rules follow from the two-photon matrix elements of the 
rank-two tensor operator T^. The selection rules for the A£ = 2 transition are[39] 

|Aj| < 2, Arrij = qi + q2 (Ai = 2) (5.22) 

where qi, q2 are the photon helicities. Transitions between nearly all the j, mj states above 
are therefore possible, with the only restriction |Amj| < 2. 

The frequency shift induced by the new couplings is therefore^^ 

Au = - ^(b3 ~ mdao + hu) [±^"^f - "^f] (5-23) 

where the it refers to j = | and | respectively. Exploiting this dependence on the quantum 
numbers, successive measurements on hydrogen alone can therefore determine the combi- 
nation (63 — rnd^Q + hi2). In addition, there will be sidereal variations due to the frame 
dependence associated with Lorentz violation. 

The same measurements on anti-hydrogen would determine the charge conjugate con- 
tribution (63 -|- mdso — hi2)- Comparing the hydrogen and anti- hydrogen results therefore 
enables us to make an independent determination of the CPT-violating coupling 63 and 
the Lorentz- violating, but CPT-preserving, combination md^o — hi2- 

In conclusion, the transition 2s — nd in free hydrogen and anti-hydrogen allows us to 
exploit in full the quantum number dependence of the general energy level formula (5.17). 
In realistic experimental conditions, taking account of issues such as Zeeman broadening 
in the magnetic trap required to study the hyperfine Zeeman Is — 2s transition and the 
limitations from laser frequency accuracy at the relevant wavelengths, it may even prove 
comparable in resolution with the Is — 2s transition. In any case, it provides an interesting 
complementary measurement in anti-hydrogen spectroscopy. 



6. Conclusions 

In this paper, we have investigated the construction and experimental implications of phe- 
nomenological extensions of QED exhibiting violations of the strong equivalence principle, 
Lorentz invariance or CPT symmetry. The underlying theme has been the formal similarity 
between these models. Both are essentially theories of the phenomenology of Lorentz non- 
invariant operators. In Kostelecky's Lorentz-violating model, these operators arise with 
multi-index couplings (perhaps to be interpreted as VEVs of tensor operators in a more 
fundamental theory of spontaneous Lorentz symmetry breaking), which may be bounded by 
experiment. In the strong equivalence violating models, this role is played by the curvature 
tensors of the background gravitational field. 

'^^For simplicity of presentation, we are neglecting the effect of the nuclear proton spin here. 



-29- 



Modifications to the Maxwell sector of QED can be tested to high accuracy using 
long-baseline polarimetry observations of astrophysical sources. After reviewing existing 
limits on birefringent propagation obtained from radio galaxies and quasars, we have shown 
how in future polarisation measurements on gamma-ray bursts at high redshift have the 
potential to enable even more stringent bounds to be placed on the Lorentz-violating 
Maxwell couplings K^jj^xp and L^^. 

In the Dirac sector, experimental tests of Lorentz and CPT violation can be made 
with high precision atomic spectroscopy comparing hydrogen and anti-hydrogen, the latter 
having become feasible following the recent production of cold anti-hydrogen atoms in 
significant numbers by the ATHENA and ATRAP collaborations. On the basis of an 
extended formula for atomic energy levels, we have proposed that the transition 2s— nd (n ~ 
10) in free hydrogen and anti-hydrogen may provide a useful complement to the currently 
envisaged programme of spectroscopy on the Is — 2s hyperfine Zeeman transition. 

One of the original motivations for this paper was to investigate whether the constraints 
on these models obtained from astrophysics could be used to bound the couplings giving 
rise to Lorentz and CPT violation in atomic spectroscopy, and vice-versa. At first sight, it 
appears that, in particular, the Chern-Simons photon coupling L'^ and the CPT-violating 
electron coupling should be related through radiative corrections. If so, this would 
have important implications for the atomic physics experiments since, as wc have seen, the 
existing bounds from radio galaxies constrain < O(10^''^)GeV while, even under the 
most optimistic experimental scenario, anti-hydrogen spectroscopy can only fix a bound 
of \bn\ < O(10'~^^)GeV. However, ambiguities in the specification of the quantum theory 
based on the Kostelecky Lagrangian arising from the axial anomaly make the relation 
between 5^ and strictly indeterminate, though perhaps it could still be argued that we 
would naturally expect them to be of the same order of magnitude. On the other hand, 
where the relation between couplings is free of anomaly ambiguities, as we would expect 
to be the case with a^,c^i, and K^^xp, the Dirac couplings do not affect the frequency of 
atomic transitions. 

The strong equivalence principle, Lorentz invariance and CPT symmetry are all fun- 
damental ingredients of conventional quantum field theories of elementary particle physics 
and cosmology. Nevertheless, as we have seen, it is relatively straightforward to modify the 
dynamics of QED or the standard model to exhibit violations of each of these principles and 
to predict corresponding experimental signatures, either in atomic physics or astronomy. 
Hopefully, these theoretical considerations will help to stimulate further high-precision ex- 
perimental work to test their validity to still greater accuracy. 
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